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Gene Expression Profiles in Normal and Cancer Cells 

This invention was made with support from the National Institutes of 
Health, Grant No. GM07309, CA57345, and CA62924. The U.S. government 
therefore retains certsun rights in the invention. 

TECHNICAL FIELD OF T WEl TNVENTION 

This invention is related to the diagnosis of cancer, and tools for 
carrying out such diagnosis. 
BACKGROU ND OF THE INVENTION 

Much of cancer research over the past 50 years has been devoted to the 
analyses of genes that are expressed diflferently in tumor cells compared to their 
normal counterparts. Although hundreds of studies have pointed out 
differences in the expression of one or a few genes, no comprehensive study of 
gene expression in the cancer cell has been reported. It is therefore not known 
how many genes are expressed differentially in tumor versus normal cells, 
whether the bulk of these differences are cell autonomous rather than bemg 
dependent on the tumor microenvironment, and whether most differences are 
cell-type specific or tumor specific. Thus there is a need in the art for 
information on the molecular changes that occur in cells during cancer 
development and progression. 
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STJMMARY OF THE INVENTION 

According to one embodiment of the invention, a method is provided 
for diagnosing colon cancer in a sample suspected of being neoplastic. The 
method comprises the steps of: 
5 comparing the level of at least one transcript in a first sample of 

a tissue to a second sample, wherein the first sample is of a colonic tissue 
suspected of being neoplastic and the second sample is of a normal human 
colonic tissue, and wherein the transcript is identified by a tag selected fi"om the 
group consisting of those shown in Table 3; 
10 identifying the first sample as neoplastic when the level of the 

at least one transcript is found to be lower in the first sample than in the second 
sample. 

According to another embodiment of the invention, another method is 
provided for diagnosing colon cancer in a sample suspected of being neoplastic. 
15 The method comprises the steps of: 

comparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a colonic tissue 
suspected of being neoplastic and the second sample is of a normal human 
colonic tissue, and wherein the transcript is identified by a tag selected fi*om the 
20 group consisting of those shown in Table 2; 

identifying the first sample as neoplsistic when the level of the 
at least one transcript is found to be higher in the first sample than in the 
second sample. 

In another embodiment of the invention an isolated and purified human 
25 nucleic acid molecule is provided. The molecule comprises a SAGE tag 

selected from SEQ ID NO: 1-732. 

In yet another aspect of the invention an isolated nucleotide probe is 
provided. The probe comprises at least 12 nucleotides of a human nucleic acid 
molecule, wherein the human nucleic acid molecule comprises a SAGE tag 
30 selected from SEQ ID NO: 1-732. 
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According to another aspect of the invention a method is provided for 
diagnosing pancreatic cancer in a sample suspected of being neoplastic. The 
method comprises the steps of: 

comparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a pancreatic tissue 
suspected of being neoplastic and the second sample is of a normal human 
colon tissue, wherein said transcript is identified by a tag selected fi*om the 
group consisting of those shown Table 4; 

identifying the first sample as neoplastic when the level of the 
at least one transcript is found to be higher in the first sample than in the 
second sample. 

According to still another embodiment of the invention a method of 
(Uagnosing cancer in a sample suspected of being neoplastic is provided. The 
method comprises the steps of: 

comparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a tissue suspected of 
bdng neoplastic and the second sample is of a normal human tissue, wherein 
said transcript is identified by a tag selected fi-om the group consisting of those 
shoAvn Table 5; 

identifying the first sample as neoplastic when the level of the 
at least one transcript is found to be higher in the first sample than in the 
second sample. 

According to another embodiment of the invention a method is 
provided to aid in the determination of a prognosis for a colon cancer patient. 
The method comprises the steps of: 

comparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a neoplastic colonic 
tissue and the second sample is of a normal human colonic tissue, and wherein 
the transcript is identified by a tag selected fi-om the group consisting of those 
shown in Table 3; 
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determining a poorer prognosis if the level of the at least one 
transcript is found to be lower in the first sample than in the second sample. 

According to another aspect of the invention a method to aid in 
determining a prognosis for a patient with colon cancer is provided. The 
5 method comprises the steps of: 

comparing the level of at least one transcript in a first tissue 
sample to a second sample, wherein the first sample is of a colonic cancer 
tissue and the second sample is of a normal human colonic tissue, and wherein 
the transcript is identified by a tag selected fi-om the group consisting of those 
10 shown in Table 2; 

determining a poorer prognosis if the level of the at least one 
transcript is found to be higher in the first sample than in the second sample. 

In yet another embodiment of the invention a method is provided for 
diagnosing colon cancer in a sample suspected of being neoplastic. The 
15 method comprises the steps of: 

comparing the level of expression of at least one protein in a 
first sample of a tissue to a second sample, wherein the first sample is of a 
colonic tissue suspected of being neoplastic and the second sample is of a 
normal human colonic tissue, and wherein the protein is encoded by a transcript 
20 identified by a tag selected fi-om the group consisting of those shown in Table 

3; 

identifying the first sample as neoplastic when the level of 
expression of the protein is found to be lower in the first sample than in the 
second sample. 

25 In another aspect of the invoition a method of diagnosing colon cancer 

in a sample suspected of being neoplastic is provided. The method comprises 
the steps of: 

comparing the level of expression of at least one protein in a 
first sample of a tissue to a second sample, wherein the first sample is of a 
30 colonic tissue suspected of being neoplastic and the second sample is of a 

normal human colonic tissue, and wherein the protein is encoded by a transcript 
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identified by a tag selected from the group consisting of those shown in Table 

2; 

identifying the first sample as neoplastic when expression of the 
protein is found to be higher in the first sample than in the second sample. 

According to another embodiment of the invention a method is 
provided to aid in determining a prognosis of a patient having pancreatic 
cancer. The method comprises the steps of: 

comparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a neoplastic 
pancreatic tissue and the second sample is of a normal human colon tissue, 
wherein said transcript is identified by a tag selected from the group consisting 
of those shown Table 4; 

determining a poorer prognosis if transcription is found to be 
higher in the first sample than in the second sample. 

In yet another aspect of the invention a method to aid in providing a 
prognosis for a cancer patient is provided. The method comprises the steps of 

comparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a neoplastic tissue 
and the second sample is of a normal human tissue of the same tissue type, 
wherdn said transcript is identified by a tag selected from the group consisting 
of those shown Table 5; 

determining a poorer prognosis if transcription is found to be 
higher in the first sample than in the second sample. 

According to still another aspect of the invention, a method is provided 
for diagnosing pancreatic cancer in a sample suspected of being neoplastic. 
The method comprises the steps of: 

comparing the level of expression of at least one protein 
encoded by a transcript in a first sample of a tissue to a second sample, wherein 
the first sample is of a pancreatic tissue suspected of being neoplastic and the 
second sample is of a normal human colon tissue, wherein said protein is 



encoded by a transcript identified by a tag selected fi-om the group consisting 
of those shown Table 4; 

identifying the first sample as neoplastic when expression of the 
protein is found to be higher in the first sample than in the second sample. 

According to yet another aspect of the invention a method is provided 
for diagnosing cancer in a sample suspected of being neoplastic. The method 
comprises the steps of: 

comparing the level of expression of at least one protein in a 
first sample of a tissue to a second sample, wherein the first sample is of a 
tissue suspected of being neoplastic and the second sample is of a normal 
human tissue, wherdn said protein is encoded by a transcript identified by a tag 
selected fi:'om the group consisting of those shown Table 5; 

identifying the first sample as neoplastic when expression of the 
protein is found to be higher in the first sample than in the second sample. 

In still another embodiment of the invention a method is provided to aid 
in the determination of a prognosis of a colon cancer patient. The method 
comprises the steps of: 

comparing the level of expression of at least one protein in a 
first sample of a tissue to a second sample, wherein the first sample is of a 
neoplastic colonic tissue and the second sample is of a normal human colonic 
tissue, and wherein the protein is encoded by a transcript identified by a tag 
selected fi-om the group consisting of those shown in Table 3; 

determining a poorer prognosis if the level of expression is 
found to be lower in the first sample than in the second sample. 

In still another embodiment of the invention a method is provided to aid 
in determining a prognosis for a patient with colon cancer. The method 
comprises the steps of: 

comparing the level of expression of at least one protein in a 
first tissue sample to a second sample, wherein the first sample is of a colonic 
cancer tissue and the second sample is of a normal human colonic tissue, and 



^ wo 98/53319 PCT/US98/10277 

7 

wherein the protein is encoded by a transcript identified by a tag selected fi^om 
the group consisting of those shown in Table 2; 

determining a poorer prognosis if the level of expression is 
found to be higher in the first sample than in the second sample. 
5 In still another aspect of the invention a method is provided to aid in 

determining a prognosis of a patient having pancreatic cancer. The method 
comprises the steps of: ^ 

comparing the level of expression of at least one protein in a 
first sample of a tissue to a second sample, wherein the first sample is of a 
10 neoplastic pancreatic tissue and the second sample is of a normal human colon 

tissue, wherein said protein is encoded by a transcript identified by a tag 
selected firom the group consisting of those shown Table 4; 

determining a poorer prognosis if the level of expression is 
found to be higher in the first sample than in the second sample. 
15 According to even a fiirther aspect of the invention a method is 

provided to aid in providing a prognosis for a cancer patient. The method 
comprises the steps of: 

comparing the level of expression of at least one protein in a 
first sample of a tissue to a second sample, wherein the first sample is of a 
20 neoplastic tissue and the second sample is of a normal human tissue of the same 

tissue type, wherein said protein is encoded by a transcript identified by a tag 
selected fi-om the group consisting of those shown Table 5; 

determining a poorer prognosis if the level of expression is 
found to be higher in the first sample than in the second sample. 
25 In still another embodiment of the invention a method of treating a 

cancer cell is provided. The method comprises the step of: 

administering to a cancer cell an antibody which specifically 
binds to a protein encoded by a transcript identified by a tag selected fi-om the 
group consisting of those shown in Tables 2, 4, and 5, wherein the antibody is 
30 linked to a cytotoxic agent. 
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In another aspect of the invention an antibody linked to a cytotoxic 
agent is provided. The antibody specifically binds to a protein encoded by a 
transcript identified by a tag selected from the group consisting of those shown 
in Tables 2, 4, and 5, 

According to another aspect of the invention, a method of detecting 
colon cancer in a patient is provided. The method comprises the steps of 

comparing the level of at least one protein or transcript in a first 
body sample to a second body sample, wherein the first sample is a body 
sample of the patient and the second sample is of a normal hunian, wherein the 
protein is encoded by a transcript and the transcript is identified by a tag 
selected from the group consisting of those shown in Table 2, wherein the first 
and second body sample is a sample selected from the group consisting of 
blood, urine, feces, sputum, and serum; 

identifying neoplasia when the level of the at least one protein 
or transcript is found to be higher in the first sample than in the second sample. 

In another aspect of the invention a method of detecting pancreatic 
cancer in a patient is provided. The method comprises the steps of: 

comparing the level of at least one protein or transcript encoded 
by a transcript in a first sample of a tissue to a second sample, wherein the first 
sample is of the patient and the second sample is of a normal human, wherein 
said protein is encoded by a transcript and the transcript is identified by a tag 
selected from the group consisting of those shown Table 4, wherein the first 
and second sample is a sample selected from the group consisting of blood, 
urine, feces, sputum, and serum; 

identifying neoplasia when the level of the at least one protein 
or transcript is found to be higher in the first sample than in the second sample. 

Also provided by the present invention is a method of detecting cancer 
in a patient. The method comprises the steps of 

comparing the levd of at least one protein or transcript in a first 
sample to a second sample, wherein the first sample is of patient and the 
second sample is of a normal human, wherein said protein is encoded by a 
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transcript and the transcript is identified by a tag selected from the group 
consisting of those shown Table 5, wherein the first and second body sample 
is a sample selected fi-om the group consisting of blood, urine, feces, sputum, 
and serum; 

identifying neoplasia when the level of the at least one protein 
or transcript is found to be higher in the first sample than in the second sample. 

Additionally provided by the present invention is a method to aid in the 
determination of a prognosis for a colon cancer patient. The method comprises 
the steps of: 

comparing the level of at least one protein or transcript in a first 
sample to a second san^)le, wherein the first sample is of a colon cancer patient 
and the second sample is of a normal human, wherein the protein is encoded 
by a transcript and the transcript is identified by a tag selected fi-om the group 
consisting of those shown in Table 3, wherein the first and second body sample 
is a sample selected fi"om the group consisting of blood, urine, feces, sputum, 
and serum; 

determining a poorer prognosis if the level of the at least one 
protein or transcript is found to be lower in the first sample than in the second 
sample. 

Provided by another embodiment of the invention is a method to aid 
in determining a prognosis for a patient with colon cancer. The method 
comprises the steps of: 

comparing the level of at least one protein or transcript in a first 
sample to a second sample, wherein the first sample is of a colonic cancer 
patient and the second sample is of a normal human, wherein the protein is 
encoded by a transcript and the transcript is identified by a tag selected fi-om 
the group consisting of those shown in Table 2, wherein the first and second 
sample is a sample selected fi'om the group consisting of blood, urine, feces, 
sputum, and serum; 
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determining a poorer prognosis if the level of the at least one 
protein or transcript is found to be higher in the first sample than in the second 
sample. 

According to still another aspect of the invention, a method to aid in 
d^ermining a prognosis of a patient having pancreatic cancer is provided. The 
method comprises the steps of: 

comparing the level of at least one protdn or transcript in a first 
sample to a second sample, v^^herein the first sample is of a pancreatic cancer 
patient and the second sample is of a normal human, wherein said protein is 
encoded by a transcript and the transcript is identified by a tag seleaed fi-om 
the group consisting of those shown Table 4, wherein said first and second 
sample is a sample selected fi-om the group consisting of blood, urine, feces, 
sputum, and serum; 

determining a poorer prognosis if the level of the at least one 
protein or transcript is foimd to be higher in the first sample than in the second 
sample. 

Also provided by the present invention is a method to aid in providing 
a prognosis for a cancer patient. The method comprises the steps of: 

comparing the level of expression of at least one protein or 
transcript in a first sample to a second sample, wherein the first sample is of a 
cancer patient and the second sample is of a normal human, wherein said 
protein is encoded by a transcript and the transcript is identified by a tag 
selected ft^om the group consisting of those shown Table 5, wherein the first 
and second sample is a sample selected fi-om the group consisting of blood, 
urine, feces, sputum, and serum; 

determining a poorer prognosis if the level of the at least one 
protdn or transcript is found to be higher in the first sample than in the second 
sample. 

The present invention fiirther includes antisense oligonucleotides 
complementary in whole or in part to SEQ ID NOS: 1-732. 
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This invention also provides a method for screening for candidate 
agents that modulate the expression of a polynuleotide selected from the group 
consisting of the polynucleotides in SEQ ID NOS. 1-732 or their respective 
complements, by contacting a test agent with a pancreatic or colon cell and 
5 monitoring expression of the poljmucleotide, wherein the test agent which 

modifies the expression of the polynucleotide is a candidate agent. 

The present invention provides the art with new methods and reagents 
for diagnosing and prognosing cancers. In addition, some of the newly 
disclosed genes may play an important role in the development of cancers. 

10 BRIEF DESCRTPTION OF THE DRAWINGS 

Fig. 1. Comparison of expression patterns in colorectal cancers and normal 
colon epithelium. (FIG. lA) A semi-logarithmic plot reveals 51 tags that 
were decreased more than 10 fold in primary CR cancer cells whereas 32 tags 
were increased more than 10 fold. 62,168 and 60,878 tags derived from 

15 normal colon epithelium and primary CR cancers, respectively, were used for 

this analysis. The relative expression of each transcript was determined by 
dividing the number of tags observed in tumor and normal tissue as indicated. 
To avoid division by 0, a tag value of 1 was used for any tag that was not 
detectable in one of the samples. These ratios were then rounded to the 

20 nearest integer and their distribution plotted on the abscissa. The number of . 

genes displaying each ratio was plotted on the ordinate. Tu: CR tumors; NC: 
Normal colon. (FIG. IB and FIG. IC) Differentially expressed genes in 
colorectal cancers. The number of transcripts found to be differentially 
expressed (P < 0.01) are presented as Venn diagrams. Diagrams of transcripts 

25 that were decreased (FIG. IB) or increased (FIG. IC) in CR cancers 

compared to normal colon epithelium. Comparisons were between primary 
tumors and cells in culture as indicated. 

Fig. 2. Northern blot analysis of genes differentially expressed in 
gastrointestinal neoplasia. Northern blot analysis was performed on total RNA 
30 (5 ng isolated from primary CR carcinomas (T) and matching normal colon 

epithelium (N), or pancreatic carcinomas. The top panel in each case show an 
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example of the ethidium bromide stained gels prior to transfer. The number of 
SAGE tags observed in the original analysis is indicated to the right of each 
blot. (FIG. 2A) Examples of transcripts that were decreased or increased in 
CR cancers. (FIG.2B) Examples of transcripts increased in pancreatic cancers 
5 (JO). (FIG.2C) Examples of transcripts elevated in cancer which were or 

were not cancer type specific. Probes used for Northern blot analysis were as 
follows (Human SAGE Tag unique identifier, gene name, (GenBank accession 
number)): (FIG. 2A) H204104, Guanylin (M95714); H259108, (see Table 2); 
H1000193, (see Table 2); H998030, (see Table 2), (FIG. 2B) H294155, 
10 RIG-E (U42376); H560056, TIMP-l (S68252). (FIG. 2C) H802810, 

EST338411 (W52120); H85882, 1-8D (X57351); H618841, GA733-1 
(X13425). 

Tables 2-5. Transcripts DiflFerentially Expressed in Human Cancer, 

Tag sequence represents the Nlain site plus the adjacent 11 bp SAGE tag. 
15 Tag number indicates a SAGE UID (unique identifier). NC, TU, CL, PT, PC, 

refers to the number of the indicated tag observed in RNA isolated fi-om 

normal colorectal epithelium, primary colorectal cancers, colorectal cancer cell 

lines, primary pancreatic cancers, or pancreatic cancer cell lines, respectively. 

The Accession and Gene Name refer to representative GenBank entries that 
20 contain the tag sequence. 

Table 2 Transcripts increased in colorectal cancer. 

Table 3 Transcripts decreased in colorectal cancer. 

Table 4 Transcripts increased in pancreatic cancer. 

Table 5 Transcripts increased in pancreatic and colorectal cancer. 

25 DETAILED DESCRIPTION 

The inventors have discovered sets of human genes which are either 
upregulated or downregulated in cancer cells, as compared to normal cells. 
Spedfically, certain genes have been found to be upregulated or downregulated 
in colorectal and/or pancreatic cancer cells, when compared to normal colon 
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cells. These sets of differentially regulated genes can be used as diagnostic 
markers, either individually or in sets of, for example, 2, 5, 10, 20, or 30. 

Genes whose expression was detected to be increased in colorectal 
cancer are shown in Table 2. Genes whose expression was detected to be 
5 decreased in colorectal cancer are shown in Table 3 . Genes whose expression 

was detected as increased in pancreatic cancer are shown in Table 4. Genes 
whose expression was detected as increased in both pancreatic cancer and 
colorectal cancer are shown in Table 5. These latter genes likely play a role in 
neoplastic development generally. 

10 Tag sequences, as provided herein, uniquely identify ge nes. This is due 

t o, their length, and their specific location (3') in a nene-fiuMn-wh ich thev are 
drawn. The full length genes can be identified by mat chin g the tag to a pene 
data base member, or by using the tag sequences as probes to phvsicallv isolate 
preyionsly unide ntified genes from cDNA libraries. The methods by which 

15 ge nes are isolated fi-om libraries using DNA probes are well known in the art 

See, for example, Veculescu et al., Science 270: 484 (1995), and Sambrook et 
al. (1989), MOLECULAR CLONING: A LABORATORY MANUAL, 2nd 
ed. (Cold Spring Haifcor Press, Cold Spring Harbor, New York). Once a gene 
or transcrip^has been identified, either by matching to a data base entry, orb5;_^ 

20 phyci^-giiyJiybnHiy^ng to a cDNA molecule, the position of the hybridizing or 

matching region in the transcript can be determined. If the tag sequence is not 
in the 3* end, immediately adjacent to the restriction enzyme used to generate 
the SAGE tags, then a spurious match may have been made. Confirmation of 
the identity of a SAGE tag can be made by comparing transcription levels of 

25 the tag to that of the identified gene in certain cell types. 

In addition to the sequences shown in SEQ ID NOS: 1-732, or their 
complements, this invention also provides the anti-sense polynucleotide stand, 
e.g. antisense RNA to these sequences or their complements. One can obtain 
an antisense RNA using the sequences provided in SEQ ID NOS: 1-732 and 

30 the methodology described in Vander Krol et al. (1988) BioTechiiiques 6:958. 
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The invention also encompasses polynucleotides which differ from that 
of the polynucleotides described above, but which produce the same 
phenotypic effect, such as the allele. These altered, but phenotypically 
equivalent polynucleotides are referred to "equivalent nucleic acids." This 
5 invention also encompasses polynucleotides characterized by changes in 

non-coding regions that do not alter the phenotype of the polypeptide 
produced therefrom when compared to the polynucleotide herein. This 
invention further encompasses polynucleotides, which hybridize to the 
polynucleotides of the subject invention under conditions of moderate or high 

10 stringency. 

The polynucleotides can be conjugated to a detectable marker, e.g., an 
enzymatic label or a radioisotope for detection of nucleic acid and/or 
expression of the gene in a cell. A wide variety of appropriate detectable 
mark^ are known in the art, including fluorescent, radioactive, enzymatic or 

15 other ligands, such as avidin/biotin, which are capable of giving a detectable 

signal. In preferred embodiments, one \wD likely desire to employ a fluorescent 
label or an enzyme tag, such as urease, alkaline phosphatase or peroxidase, 
instead of radioactive or other environmental undesirable reagents. In the case 
of enzyme tags, colorimetric indicator substrates are known which can be 

20 employed to provide a means visible to the human eye or 

spectrophotometrically, to identify specific hybridization with complementary 
nucleic acid-containing samples. Briefly, this invention further provides a 
method for detecting a single-stranded polynucleotide identified by SEQ ID 
NOS. 1-732 or its complement, by contacting target single-stranded 

25 polynucleotides with a labeled, single-stranded polynucleotide (a probe) which 

is at least 10 nucleotides of the complement of SEQ ID NOS: 1-732 (or the 
corresponding complement) under conditions permitting hybridization 
(preferably moderately stringent hybridization conditions) of complementary 
single-stranded polynucleotides, or more preferably, under highly stringent 

30 hybridization conditions. Hybridized polynucleotide pairs are separated from 

un-hybridized, single-stranded polynucleotides. The hybridized polynucleotide 
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pairs are detected using methods well known to those of skill in the art and set 
forth, for example, in Sambrook et al. (1989) supra. 

The polynucleotides of this invention can be isolated using the 
technique described in the experimental section or replicated using PGR. The 
5 PGR technology is the subject matter of United States Patent Nos.4,683, 195, 

4,800,159, 4,754,065, and 4,683,202 and described in PGR: The Polymerase 
Ghain Reaction (MuUis et al. eds, Birkhauser Press, Boston (1994)) or 
MacPherson et al. (1991) and (1994), supra, and references cited therein. 
Alternatively, one of skill in the art can use the sequences provided herein and 

10 a commercial DNA synthesizer to replicate the DNA. Accordingly, this 

invention also provides a process for obtaining the polynucleotides of this 
invention by providing the linear sequence of the polynucleotide, nucleotides, 
^propriate primer molecules, chemicals such as enzymes and instructions for 
their replication and chemically replicating or linking the nucleotides in the - 

15 proper orientation to obtain the polynucleotides. In a separate embodiment, 

these polynucleotides are further isolated. Still further, one of skill in the art 
can insert the polynucleotide into a suitable replication vector and insert the 
vector into a suitable host cell (procaryotic or eucaryotic) for replication and 
amplification. The DNA so amplified can be isolated fi*om the cell by methods 

20 well known to those ofskill in the art. A process for obtaining polynucleotides 

by this method is fijrther provided herein as well as the polynucleotides so 
obtained. 

RNA can be obtained by first inserting a DNA polynucleotide into a 
suitable host cell. The DNA can be inserted by any appropriate method, e.g., 

25 by the use of an appropriate gene delivery vector or by electroporation. When 

the cell replicates and the DNA is transcribed into RNA; the RNA can then be 
isolated using methods well known to those of skill in the art, for example, as 
set forth in Sambrook et al. (1989) supra. For instance, mRNA can be isolated 
using various lytic enzymes or chemical solutions according to the procedures 

30 set forth in Sambrook et al. (1989), supra or extracted by nucleic-acid-binding 

resins following the accompanying instructions provided by manufactures. 
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Polynucleotides having at least 10 nucleotides and exhibiting sequence 
complementarity or homology to SEQ ID NOS: 1-732 find utility as 
hybridization probes. In some aspects, the full coding sequence of the 
transcript, i.e., for SEQ ID NOS: 1-732, are known. Accordingly, any portion 
of the known sequences available in GenBank, or homologous sequences, can 
be used in the methods of this invention. 

It is known in the art that a "perfectly matched" probe is not needed for 
a specific hybridization. Minor changes in probe sequence achieved by 
substitution, deletion or insertion of a small number of bases do not affect the 
hybridization specificity. In general, as much as 20% base-pair mismatch 
(when optimally aligned) can be tolerated. Preferably, a p robe useful for 
detecting the aforementioned mRNA is at least about 80% identical to the 
homologo us region of comparable size contained in the previously identified 
sequences identified by SEQ ID NOS: 1-732, which correspond to previously 
characterized genes or SEQ ID NOS: 1-732, which correspond to known 
ESTs. More preferably, the probe is 85% identical to th e corresponding gene 
.sequ^ice-after-alignment-<>f^the-homologous.rj?son; even more preferably, it 
...-e xhibit s 90% id entityr 

These probes can be used in radioassays (e.g. Southern and Northern 
blot analysis) to detect, prognose, diagnose or monitor various pancreatic or 
colon cells or tissue containing these cells. The probes also can be att ached to 
a solid sup port or an arr a y ^l ^ h ^ ^^^p "^^ hiph thrn»p [ hput -.screeninf 
a ssays for the detection of ex pression-of the ffene.-CQgesponding-to-jQn&_OL 
more polynucleotide(s) of this invention. Accordin ^y, this invention also 
pravideS -at le a st on^ rif th^ tr^ncrriptg iHpntifiprf as SK<7 T D NOS: 1-732, or its 
_-x:Qmpl cmcnt, attached to a solid support for u seJniiighjthrou ghput screen s. 

The total size of firagment, as well as the size of the complementary 
stretches, will depend on the intended use or application of the particular 
nucleic add segment. Smaller fiagments will generally find use in hybridization 
embodiments, wherein the length of the complementary region may be varied. 
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such as between about 10 and about 100 nucleotides, or even full length 
according to the complementary sequences one wishes to detect. 

Nucleotide probes having complementary sequences over stretches 
greater than 10 nucleotides in length are generally preferred, so as to increase 
stability and selectivity of the hybrid, and thereby improving the specificity of 
particular hybrid molecules obtained. More preferably, one can design 
polynucleotides having gene-complementary stretches of more than 50 
nucleotides in length, or even longer where desired. Such fi-agments may be 
readily prepared by, for example, directly synthesizing the fi-agment by 
chemical means, by application of nucleic acid reproduction technology, such 
as the PGR technology with two priming oligonucleotides as described in U.S. 
Pat. No. 4,603,102 or by introducing selected sequences into recombinant 
vectors for recombinant production. A preferred probe is about 50-75 or more 
preferably, 50-100, nucleotides in length. 

The po lynucleotides of the present inve ntion can serve as primers for 
the detection of genes or gene transc ripts that ar e expressed in pan creatic or 
colon cells. In this context, amplification means any method employing a 
primer-dependent polymerase capable of replicating a target sequence with 
reasonable fidelity. Amplification may be carried out by natural or recombinant 
20 DNA-polymerases such as T7 DNA polymerase, Klenow fi*agment of E.coli 

DNA polymerase, and reverse transcriptase. 

A preferred amplification method is PGR. However, PGR conditions 
used for each reaction are empirically determined. A number of parameters 
influence the success of a reaction. Among them are 2Uinealing temperature 
25 and time, extension time, Mg^* ATP concentration, pH, and the relative 

concentration of primers, templates, and deoxyribonucleotides. After 
amplification, the resulting DNA fragments can be detected by agarose gel 
electrophoresis followed by visualization with ethidium bromide staining and 
ultraviolet illumination. 
30 The invention fiirther provides the isolated polynucleotide operatively 

linked to a promoter of RNA transcription, as well as other regulatory 



10 
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sequences for replication and/or transient or stable expression of the DNA or 
RNA. As used hwein, the term "operatively linked" means positioned in such 
a manner that the promoter will direct transcription of RNA off the DNA 
molecule. Examples of such promoters are SP6, T4 and T7. In certain 
5 embodiments, ceU-specific promoters are used for cell-specific expression of 

the inserted polynucleotide. Vectors which contain a promoter or a 
promoter/enhanca", with termination codons and selectable marker sequences, 
as weU as a cloning site into v/inch an ins^ed piece of DNA can be operatively 
linked to that promoter are well known in the art and conunercially available. 

10 For general methodology and cloning strategies, see Gene Expression 

Technology (Goeddel ed.. Academic Press, Inc. (1991)) and references cited 
therdn and Vectors: Essential Data Series (Gacesa and Ramji, eds., John Wiley 
& Sons, N.Y. (1994)), which contains maps, functional properties, commercial 
suppliers and a reference to GenEMBL accession numbers for various suitable 

15 vectors. Preferable, these vectors are capable of transcribing RNA in vitro or 

in vivo. 

Fragment of the sequences shown in SEQ ID NOS: 1-732 or their 
respective complements also are encompassed by this invention, preferably at 
least 10 nucleotides and more preferably having at least 18 nucleotides. Larger 
20 polynucleotides, e.g., cDNA or genomic DNA, which hybridize under 

moderate or stringent conditions to the polynucleotide sequences shown in 
SEQ ID NOS: 1-732, or their respective complements, also are encompassed 
by this invention. 

In one embodiment, these firagments are polynucleotides that encode 
25 polypeptides or proteins having diagnostic and therapeutic utilities as described 

herein as well as probes to identify transcripts of the protein which may or may 
not be present These nucldc add fragments can by prepared, for example, by 
restriction eniyme digestion of the polynucleotide of SEQ ID NOS: 1-732, or 
their complements, and then labeled with a detectable marker. Alternatively, 
30 random fragments can be gen^ted using nick translation of the molecule. For 
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methodology for the preparation and labeling of such fragments, see Sambrook 
et al , (1989) supra. 

Expression vectors containing these nucleic acids are useful to obtain 
host vector systems to produce proteins and polypeptides. It is implied that 
these expression vectors must be replicable in the host organisms either as 
episomes or as an integral part of the chromosomal DNA. Suitable expression 
vectors include viral vectors, including adenoviruses, adeno-associated viruses, 
retroviruses, cosmids, etc. Adenoviral vectors are particularly useful for 
introducing genes into tissues in vivo because of their high levels of expression 
and efficient transformation of cells both in vitro and in vivo. When a nucleic 
add is inserted into a suitable host cell, e.g., a procaryotic or a eucaryotic cell 
and the host cell replicates, the protein can be recombinantly produced. 
Suitable host cells will depend on the vector and can include mammalian cells, 
animal cells, human cells, simian cells, insect cells, yeast cells, and bacterial 
cells constructed using well known methods. See Sambrook et al. (1989) 
supra. In addition to the use of viral vector for insertion of exogenous nucleic 
acid into cells, the nucleic acid can be inserted into the host cell by methods 
well known in the art such as transformation for bacterial cells; transfection 
using calcium phosphate precipitation for manmialian cells; or DEAE-dextran; 
electroporation; or microinjection. See Sambrook et al. (1989) supra for this 
methodology. Thus, this invention also provides a host cell, e.g. a mammalian 
cell, an animal cell (rat or mouse), a human cell, or a procaryotic cell such as 
a bacterial cell, containing a polynucleotide encoding a protein or polypeptide 
or antibody. 

When the vectors are used for gene therapy in vivo or ex vivo, a 
pharmaceutically acceptable vector is preferred, such as a 
replication-incompetent retroviral or adenoviral vector. Pharmaceutically 
acceptable vectors containing the nucleic acids of this invention can be further 
modified for transient or stable expression of the inserted polynucleotide. As 
used herdn, the term "phamiaceutically acceptable vector" includes, but is not 
limited to, a vector or delivery vehicle having the ability to selectively target 



wo 98/53319 PCT/US98/ 10277 

20 

and introduce the nudeic add into dividing cells. An example of such a vector 
is a "replication-incompetent" vector defined by its inability to produce viral 
proteins, precluding spread of the vector in the infected host cell. An example 
of a replication-incompetent retroviral vector is LNL6 (Miller, A.D. et al. 
5 (1989) BioTechniques 7:980-990). The methodology of using 

replication-incompetent retroviruses for retroviral-mediated gene transfer of 
gene markers is v^ell established (Correll et al, (1989) PNAS USA 86:8912; 
Bordignon (1989) PNAS USA 86:8912-52; Culver, K. (1991) PNAS USA 
88:3155; and Rill, D.K (1991) Blood 79(10):2694-700. Clinical investigations 

10 have shown that there are few or no adverse effects associated with the viral 

vectors, see Anderson (1992) Science 256:808-13. 

Compositions containing the polynucleotides of this invention, in 
isolated form or contained within a vector or host cell are further provided 
herein. When these compositions are to be used pharmaceutically, they are 

15 combined with a pharmaceutically acceptable carrier. 

This invention further encompasses genes, either genomic or cDNA, 
which code for a polypeptide or protein in the cell of interest. The genes 
specifically hybridize under moderate or stringent conditions to a 
polynucleotide identified by SEQ ID NOS: 1-732 or their respective 

20 complements. The process of identification of larger fragment or the 

fiiU-length coding sequence to which the partial sequence depicted in SEQ ID 
NOS: 1-732 hybridizes preferably involves the use of the methods and reagents 
provided in this invention, either singularly or in combination. 

Five methods are disclosed herein which allows one of skill in the art 

25 to isolate the gene or cDNA corresponding to the transcripts of the invention. 

RACE-PCR Technique 

One method to isolate the gene or cDNA which code for a polypeptide 
or protdn and vAnch corresponds to a transcript of this invention, involves the 
30 5 -RACE-PCR technique. In this technique, the poIy-A mRNA that contains 

the coding sequence of particular interest is first identified by hybridization to 
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a sequence disclosed herein and then reverse transcribed with a 3 '-primer 
comprising the sequence disclosed herein. The nevvdy synthesized cDNA strand 
is then tagged with an anchor primer of a known sequence, which preferably 
contains a convenient cloning restriction site attached at the 5'end. The tagged 
5 cDNA is then amplified with the 3 -primer (or a nested primer sharing sequence 

homology to the internal sequences of the coding region) and the 5*-anchor 
primer. The amplification may be conducted under conditions of various levels 
of stringency to optimize the amplification specificity. 5'-RACE-PCR can be 
readily performed using commercial kits (available fi-om, e.g., BRL Life 
10 Technologies Inc, Clotcch) according to the manufacturer's instructions, 

Tdentificatio n of known genes or ESTs 

In addition, databases exist that reduce the complexity of ESTs by 
assembling contiguous EST sequences into tentative genes. For example, 
15 TIGR has assembled human ESTs into a datable called THC for tentative 

human consensus sequences. The THC database allows for a more definitive 
assignment compared to ESTs alone. Software programs exist (give examples) 
that allow for assembling ESTs into contiguous sequences fi*om any organism. 

20 Tsolation of cDNAs firom a Hhrarv bv orohing with the SAGE transgnpt Of tag 

Alternatively, niRNA fi-om a san^le preparation was used to construct 
cDNA library in the ZAP Express vector following the procedure described in 
Vdculescu et al. (1997) Science 270:484. The ZAP Express cDNA synthesis 
kit (Stratagene) was used accordingly to the manufacturer's protocol. Plates 

25 containing 250 to 2000 plaques are hybridized as described in Rupert et al. 

(1988) MoL Cell. Bio. 8:3104 to oligonucleotide probes with the same 
conditions previously described for standard probes exxcept that the 
hybridization temperature is reduced to room temperature. Washes are 
performed in 6X standard-saline-citrate 0.1% SDS for 30 minutes at room 

30 temperature. The probes are labeled with 32P-ATP through use of T4 

polynucletoide kinase. 
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Human alpha-actinin. 


609F Homo sapiens cDNA clone 609 similar to SET protein 


HHEAI8W H. sapiens partial cDNA sequence; clone HEA 18 W; 


zq73e07.rl Stratagene neuroepiihclium (ff93723l)Homo sapiens cDNA 
clone 647268 5* similar to TR:EI69I0 EI6910 ENDONUCLEASE. ; 
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zl83fD8.sl Stratagcne colon {W37204) Homo sapiens cDNA clone 
511239 3' 


yj23gl l.rl Homo sapiens cDNA clone 149636 5'. 
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Human profilin mRNA, complete cds. 
Uiimftn cmnnth muscle mvosin alkali tieht chain mRNA 


Human M4-50 mRNA for HLA class I antigen. 
H.sapicns mitochondrial EST sequence (00IT24) from 
zl74e07.sl Stratagcne colon (#937204) Homo sapiens cDNA clone 
510372 3* similar to contains Alu repetitive element 
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Unknown 

ynOIbOl.rl Homo sapiens cDNA clone 1671 13 5' similar loSP:ZK783.l 
CE00760 

EST277 Homo sapiens cDNA clone I0H4. 
H.sapiens mRNA for non-muscle type cofilin. 


H.sapiens p27 mRNA. 

H.sapiens mitochondrial EST sequence (009T28) from 

zal6a03.sl Homo sapiens cDNA clone 292684 3' similar to contains Alu 

repetitive element;contains clement LI repetitive element 


2e30b!0,sl Soares retina N2l)4HR Homo sapiens cuna clone 
360475 3* similar to conuins Alu repetitive clement 


yll4h01.sl Homo sapiens cDNA clone 158257 3' similar to contains Alu 
repetitive element;contains TARI repetitive clement ;. 


zr79h 1 1 .s I Scares NhHMPu S 1 Homo sapiens cUN A clone 05 1 y 5 / J 
similartoWP:C33AI2.7CE05353 
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2c39cl l.sl Soares senescent fibroblasts NbHSF Homo sapiens cDNA 
clone 324716 3* 


zb90ro3.sl Soares senescent nbroblasts NbHSF Homo sapiens cUNA 
clone 310877 3* 


yfl3hl2.sl Homo sapiens cDNA clone I2679I 3". 


Homo sapiens colon mucosa-associated (DRA) mRNA 


Human clone HP-DAOl diamme oxidase 


zb68b06.sl Homo sapiens cDNA clone 308723 3'. 


NIBI986 Normalized infant bram, Bcnto Soares Homo sapiens cDNA 
3'end. 


yu22h07.sl Homo sapiens cDNA clone 234589 3' similar to 
SP:SBP MOUSE PI 7563 SELENIUM-BINDING 


EST47523 Homo sapiens cDNA 3' end similar to similar to Selenium- 
binding proteinjiver. 


Human messenger RNA for alpha globin. 


Unknown 


Human jun-D mRNA for JUN-D protem. 


yh83fD4.rI Homo sapiens cDNA clone 136351 5. 


yj44e07.s 1 1 lomo sapiens cDN A clone 1 5 1 020 3'. 


yh83ro4.sl Homo sapiens cDNA clone 13635 1 3'. 


zl7lc06.rl Siratagene colon (#937204) Homo sapiens cDNA clone 
510082 5' 


IH.sapiens mitochondrial EST sequence (007 i 13} Irom 


H.sapiens mRNA for E-cadherin. 


Human mRNA for pancreatic trypsmogen III. 


yl26g01sl Homo sapiens cDN A clone 159410 3'. 


Human brain-type clathnn light-chain b mRNA. 


yy92c07.sl Homo sapiens cDNA clone 28 1004 3' similar to contains Alu 
repetitive clemcnl;contains element MER32 repetitive clement 


Human A33 antigen precursor mRNA, complete cds 
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yt85h08.sl Homo sapiens cDNA clone 23 1 135 3*. 


|ya05b02.sl Homo sapiens cDNA clone 60555 J. \ 
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Human CO-029. 

ym89cl0.sl Homo sapiens cDNA clone 166098 3*. 

H.sapicns Irp mRNA. 

Unknown 

Human carbonic anhydrase II mRNA, complete cds. 
Unknown 

H.sapicns zinc finger transcriptional regulator mRNA 
iHuman ERKI mRNA fororotcin serine/thrconine kinase 


Human cellular oncogene c-fos (complete sequence). 
Human uroguanylin mRNA, complete cds. 

zs57c03.sl Soares NbHTGBC Homo sapiens cDNA clone 701 572 3' 


yb47a0l.sl Homo sapiens CUNA cionc f^iw j coniainmg li 
repetitive clement 


yf36el0.sl Homo sapiens cUNA clone iouyj similar lo goiAu/ 1 /j 
INTER-ALPHA-TRYPSIN INHIBITOR COMPLEX COMPONENT II 


zu65c08.sl Soares testis NHT Homo sapiens cDNA clone 742862 3* 
Unknown 

vl07bl0.rl Homo saoicns cDNA clone 147547 .V. 


EST4721 1 Homo sapiens cDNA 3' end similar to None.. 
zdl7g02.sl Soares fetal heart NbHHI9W Homo sapiens cDNA clone 
340946 3* 


zt47cl2.sl Soares ovary tumor NbHU I Momo sapiens cuin a cionc 
725518 3' 

H.sapicns HE4 mRNA for extracellular proteinase inhibitor homoiogue 


Unknown 

yg52g07.sl Homo sapiens cDNA clone 36232 3' similar to gb:M33987 

CARBONIC ANHYDRASE! 

vv t OaOf, <: 1 Hnmo cnniens cDNA clone 26 1 490 3\ 


zo97h01.sl Slratagenc ovarian cancer (#937219) Homo sapiens cDNA 
clone 594865 3' 

yxl5g08.sl Homo sapiens cDN A clone 26 1854 3'. 
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H.$8piens mRNA for calcium-binding protein S 1 OOP. 1 




cmb|Z698811HSSERCA3M H.sapiens mRNA for adenosme 
triphosphatase, calcium 


ye65c02.r1 Homo sapiens cDNA clone 122594 5'. 
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gb|AA347726|AA347726 EST54I32 Fetal heart 11 Homo sapiens cDNA 
5' end similar to transmembrane secretory component 




Homo sapiens bone-derived growth factor (BPGF- 1 ) m 


H.sapiens CL 1 00 mRNA for protem tyrosmc phosphase 


Human N-benzoyl-L-tyrosyl-p-ammo-benzoic acid hydrolase 
alpha subunit (PPH alpha) mRNA, complete cds 


Human mRNA for transmembrane carcmoembryonic antigen (CEA) 
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yo45d0l.sl Homo sapiens cDNA clone 180865 3' similar to contains 
PTR5 repetitive clement 


yo36g07.sl Homo sapiens cDNA clone 180060 3* similar to contains 
PTR5 repetitive element 


Human l-plnstin mRNA, complete cds. 


HSB4BMR H.sapiens mRNA Tor B4B 


Human epithelial membrane protem (CL-20) mRNA, complete cds 


HSPAPR H.sapiens mRNA for Proeression Associated Protem 


yj64gl0.rl Homo sapiens cDNA clone 153570 5*. 


ESTI0a24 Clonlcch adult human fat cell library HLIIU8A Homo 
sapiens cDN A dope t0a24. 


yd84b04.sl Homo sapiens cDNA clone 1 14895 3'. 


2ol5g05,sl Stratagene colon (#937204) Homo sapiens cDNA clone 
587000 3' 


yj58gl 1. si Homo sapiens cDNA clone 152996 3'. 


ya84liO 1 .s 1 Homo sapiens cDN A clone 6840 1 3-. 






1 gb|R730l3|R73013 yj94a09.rl Homo sapiens cDNA clone 156376 5'. | 
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[Human fetal brain cDNA 3'-end GEN-007C04. 


rb9 1 h 1 1 .s 1 Scares parathyroid tumor NbHPA Homo sap 


H.sapicns mitochondrial EST sequence (132-20) from skeleta! 
muscle 


EST186995 HCC cell Ime (malastasis to liver m mouse) 11 Homo 
sapiens cDNA 5* end 


H. sapiens partial cDNA sequence; clone A6A03 ; ver 


yw53h0l,sl Homo sapiens cDNA clone 255985 3\ 


Human MHC class I HLA-A2 gene, complete cds. 


yf25n2.sl Homo sapiens cDNA clone 127919 3*. 


yl22cl0.sl Homo sapiens cDNA clone 158994 3'. 


EST58371 Homo sapiens cDNA 3' end similar to None.. 


H.sapiens mitochondrial EST sequence (129-09) 


zt54n0.sl Soares ovary tumor NbHO 1 Homo sapiens cUNA clone 
726187 3' 


zt3lcl 1.rl Soares ovary tumor NbHO r Homo sapiens cUNA clone 
723956 5' similar to TR:G205858 G205858 RAT ORF 


2b62d07.sl Soarcs fetal lung NbHLl9W Homo sapiens cUNA clone 
308173 3' similar to FIR:A39484 A39484 androgen-withdrawal 
apoptosis protein RVPI, prostatic - rat 


zbl9c06,sl Homo sapiens cDNA clone 302506 3 similar to 
PIR:A39484 A39484 androgen-withdrawal apoptosis protein RVPI, 
prostatic - rat ; 


zk39d06.sl Soarcs pregnant uterus NbHPU Homo sapiens cUNA 
clone 485 1 95 3* similar to PIR: A39484 A39484 androgen- 
withdrawal apoptosis protein RVPI 


Human partial cDNA sequence with CtA repeat region 


iHuman episiaiin variant A mRNA, 3' end. 


Unknown 


scq8l6 Homo sapiens cDNA clone b4HB3MA-COT8-HAP-Ft 


H.sapiens mRNA for Ll-cadherm. 
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dbj|C00470|C00470 HUMGS0007620. Human Ucne 5>ignature, J - 
directed cDNA sequence. 


|yy62g08.sl Homo sapiens cDNA clone ^/iH /4 J . I 
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H3 87596 
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rATnCGCTTTAGGGA 
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CATGAI n ICTAAAA 


CATGCACTTGCCCT 


CATGCCTGCTGCAGG 


CATGAGAACCTTCCA 


rATGCTCTGCCCTC 






CATGGCCATCCCCTT 
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CATGAGGATGTGGG 
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zo80fD4.sl Stratagene ovarian cancer (#937219) Homo sapiens 
cDNA clone 593215 3' 


zv40a02.sl Scares ovary tumor NbHOT Homo sapiens cDNA clone 
756074 3' 


2l92g08.sl Stratagene colon (#937204) Homo sapiens cDNA clone 
5121263' 


zt56bl2.sl Soares ovary tumor NbHOT Homo sapiens cDNA clone 
726335 3" 


yj73h02.rl Homo sapiens cDNA clone 154419 5' simil 


Human HL60 3'directed Mbol cDNA, HUMGSO 1 086, clone 


Unknown 


Human TSC-22 protein mRNA, complete cds. 


yj05g03.rl Homo sapiens cDNA clone 147892 5'. 


Human fetal brain cDNA 3-end GEN-007D07. 


Human DNA for putative protem kinase. 


Human alpha- l-antitrypsm mRNA, complete cds. 


yi8lg01.rl Homo sapiens cDNA clone 145680 5'. 


Human intestinal mucin mRNA, partial cds, clone bM 


EST082 Homo sapiens cDNA clone 3E6.. 


za63all.sl Homo sapiens cDNA clone 297212 3* similar to 
PIR:S49589 S49589 cortical granule lectin - African clawed frog ;. 


\EST30893 Homo sapiens cDNA 5' end similar to None.. 


yq92e02,sl Homo sapiens cDNA clone 203258 3' simil 


Human RASF-A PLA2 mRNA, complete cds. 


EST86866 HSCI72 cells 1 Homo sapiens cDNA 5' end 


2n93g08.rl Stratagene lung carcinoma 937218 Homo sapiens 
cDNA clone 565790 5' 


!zki0f05.sl Soares pregnant uterus NbHPU Homo sapiens cDNA 
clone 470145 3' 


Human colon 3'directed Mbol cDNA. HUMGS04047, clon 


zr72g02,sl Sonres NhHMPu SI Homo sapiens cDNA clone 668978 
3' 


yl75ro7.sl Homo sapiens cDNA clone 43778 3'. 


Unknown 


ESTl 12734 Colon I Homo sapiens cDNA 5' end ( 
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CATGGGTCCTCTCTT 


CATGATGGGCTTGAT 


CATGCTGCCCCCCAT 


CATGTTCTCTACACA 


CATGAAGAAGCAGGG 


CATGAGTAGGTGGCC 


CATGACAGTGTGTGT 


CATGGGAAAAGTGGT 
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CATGACACCCATCAC 
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Human mRNA for vascular anticoagulant 1 


Human placental anticoagulant protein (PAP) mRNA 


Human lipocortin-V mRNA, complete cds 


iHuman endonexin II mRNA, complete cds 


GAMMA-INTERFERON-INDUCIBLE PROTEIN lP-30 PRECURSOR 
(HUMAN) 


EST97384 Thymus 11 Homo sapiens cDNA 3' end similar to interferon, 
gamma transducer 1 


Human ribosomal protein L9 mRNA 


Human ribosomal protem L9 mRNA, complete cds 


Human mRNA for human homologue of rat ribosomal protein 


zm03a05.sl Stratagene corneal stroma (#937222) Homo sapiens cDNA 
clone 513008 3' 


RNA polymerase D transcription factor SIIIplS subunit mRNA 


H.sapiens CpG DNA, clone 13alO, reverse read cpgl 




H.sapiens mRNA for mitochondrial dodecenoyl-CoA dehydrogenase 


< 

I 

1 
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< 

(3 
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ii 


40S RIBOSOMAL PROTEIN S3A (HUMAN) 


Human insulin-like growth factor binding protem 4 


Human msuhn-like growth factor bindmg protem-4 (IGFBP4) gene, 
promoter and complete cds 


H-sapiensmtsl gene. 


Human CAPL protein mRNA, complete cds 


yx70b09.sl Homo sapiens cDNA clone 267065 3' similar to gb:L12350 
THROMBOSPONDIN 2 PRECURSOR (HUMAN) 


zl25el l.sl Scares ovary tumor NbHOT Homo sapiens cDNA clone 714 188 
3* similar to gb:M33680 CD81 ANTIGEN (HUMAN) 


CD81 antigen 


Neurosin 


Iprotease M I 


XI2454 


IM18366 


M21731 1 


IJ03745 1 


J03909 
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U09953 


U21138 


D)453l 


AA063259 


L42856 


Z59242 




Z25820 


L24774 1 


M847II 


M62403 


U20982 


Z33457 


IM80563 1 


N23207 


AA285023 


IM33680 1 


D78203 


IU62801 1 
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H213518 




H2 136791 






H2 13751 


H219750 


H229502I 




H235531 




H243676 


H243710 




H244487I 




H270083 


H286424 




H291889 












CATGATCAAGAATCC 




CATGATCAAGGGTGT 






1 CATGATCAAGTTCGA 


CATGATCCGGCGCCA 


CATGATGAAACTTCG 




1 

CATGATGCGAAAGGC 




CATGATGTCTTCGTT 


CATGATGTCTTTTCT 




CATGATGTGTAACGA 




CATGCAACTTAAAGC 


CATGCACCTGTCCTT 
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Isolation of partial cDNA (3* fragment) bv 3' directed PGR reaction 

This procedure is a modification of the protocol described in Polyak et 
al. (1997) Nature 389:300. Briefly, the procedure uses SAGE tags in PGR 
reaction such that the resultant PGR product contains the SAGE tag of interest 
5 as well as additional cDNA, the length of which is defined by the position of 

the tag with respect to the 3* end of the cDNA. The cDNA product derived 
fi'om such a transcript driven PGR reaction can be used for many applications. 

RNA fi^om a source believed to express the cDNA corresponding to a 
given tag is first converted to double-stranded cDNA using any standard 

10 cDNA protocol. Similar conditions used to generate cDNA for SAGE library 

construction can be employed except that a modified oligo-dT primer is used 
to dreive the first strand synthesis. For example, the oligonucleotide of 
compositon 5*-B-TGG GGG GCG CCG TTT T GG GAG TGA GGA(30)-3', 
contains a poIy-T stretch at the 3' end for hybridization and priming fi"om 

15 poly-A tails, an Ml 3 priming site for use in subsequent PGR steps, a 5' Biotin 

label (B)-for capture to strepavidin-coated magnetic beads, and an AscI 
restriction endonuclease site for releasing the cDNA fi-om the 
streptavidin-coated magnetic beads. Theoretically, any sufficiently-sized DNA 
region capable of hybridizing to a PGR primer can be used as well as any other 

20 8 base pair recognizing endonuclease. 

cDNA constructed utilizing this or similar modified oligo-dT primer is 
then processed exactly as described in U.S. Patent No. (insert) up until adapter 
ligation where only one adapter is ligated to the cDNA pool. After adapter 
ligation, the cDNA is released fi-om the streptavidin-coated magnetic beads and 

25 is then used as a template for cDNA amplification. 

Various PGR protocols can be employed using PGR priming sites 
within the 3* modified oligo-dT primer and the SAGE tag. The SAGE 
tag-derived PGR primer employed can be of varying length dictated by 5* 
extension of the tag into the adaptor sequence. cDNA products are now 

30 available for a variety of applications. 
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This technique can be further modified by: (1) altering the length and/or 
content of the modified oligo-dT primer; (2) ligating adaptors other than that 
previously employed within the SAGE protocol; (3) performing PGR ft-om 
template retained on the streptavidin-coated magnetic beads; and (4) priming 
first str£md cDNA synthesis with non-oligo-dT based primers. 

Isolation of cDNA using GeneTrapper or modified GeneTra pper Technology 
The reagents and manufacturer's instructions for this technology are 
commercially available fi-om Life Technologies, Inc., Gaithersburg, Maryland. 
Briefly, a complex population of single-stranded phagemid DNA containing 
directioneil cDNA inserts is enriched for the target sequence by hybridization 
in solution to a biotinylated oligonudeotide probe complementary to the target 
sequence. The hybrids are captured on streptavidin-coated paramagnetic 
beads. A magnet retrieves the paramagnetic beads fi^om the solution, leaving 
nonhybridized single-stranded DNAs behind. Subsequently, the captured 
single-stranded DNA target is released fi-om the biotinylated oligonucleotide. 
After release, the cDNA clone is fiirther emiched by using a nonbiotinylated 
target oligonucleotide to specifically prime conversion of the single-stranded 
target to double-stranded DNA. Following transformation and plating, 
typically 20% to 100% of the colonies represent the cDNA clone of interest. 
To identify the desired cDNA clone, the colonies may be screened by colony 
hybridization using the 32P-labeled oligonucleotide as described above for 
solution hybridization, or alternatively by DNA sequencing and alignment of 
all sequences obtained fi-om numerous clones to determine a consensus 
sequence. 

The genes vAnch are identified herein as being differentially expressed 
in normal and cancer cells can be used diagnostically and prognostically. 
Transcription levels in a test sample suspected of being neoplastic can be 
determined and compared to the levels in normal colon cells. The test sample 
may be fi-om any tissue suspected of neoplasia, and particularly fi-om either 
suspeaed colorectal or suspected pancreatic cancer cells. The control cells for 
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the purposes of comparison are normal cells, preferably of the same tissue type 
as the test sample, e.g,, colon cells, or pancreatic duct epithelial cells. 
Upregulation of transcription or downregulation of transcription is therefore 
diagnostic of the neoplastic state, depending on what gene is used as a test 
reagent. Similarly, transcription levels can be monitored to assess patent 
responses to anti-tumor therapies. Transcription levels will also provide 
prognostic information. For example, the level of transcription in a test sample 
can be compared to levels found in bona fide normal and tumor cells. More 
extreme deviations fi^om normal expression levels indicate a poorer prognosis. 

Transcription levels can be determined according to any means known 
in the art. These include, without limitation. Northern blots, nuclear run-on 
assays, in vitro transcription assays, primer extension assays, quantitative 
reverse transcriptase-polymerase chain reactions (RT-PCR), and hybrid filter 
binding assays. These techniques are well known in the art. See J.C. Alwine, 
D.J. Kemp, G.R. Stark, Proc. Natl Acad Sci. U.S.A. 74, 5350 (1977); K. 
Zinn, D. Di-Maio, T, Maniatis, Cell 34, 865 (1983); G. Veres, R.A. Gibbbs, 
S.E. Scherer, C.T; Caskey, Science 237, 415 (1987). 

Similarly, upregulated genes and downregulated genes can be detected 
by measuring expression of their protein products. This can be done by any 
means known in the art, including but not limited to Western (inununo) blot, 
enzyme linked immunoadsorbent assay, radioimmunoassay, and enzyme assay. 
Such techniques are well known in the art. Protein products can be detected 
in tissue samples of a test patient, using a suspect sample as a test sample, and 
a matched normal tissue sample from the same tissue type as a control. If 
normal tissue is not available then a closely related tissue type can be used. 
Desirably both the samples being compared will be from the same individual. 
Alternatively, aberrant expression levels of protein products can be detected in 
body samples, such as blood, serum, feces, urine, sputum. As a control, a 
normal matched sample can be used from a healthy individual. Aberrant 
expression levels of transcripts can also be detected in such body samples, 
particularly in blood and serum. 
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Probes for use in the assays for transcription levels of particular genes 
or sets of genes may be RNA or DNA. The probes will be isolated 
substantially free of other cdlular RNAs or DNAs. If the reagent contains one 
probe then it will comprise at least 50% of the nucleic acids in the reagent 
5 composition. If the reagent contains more than one probe, then the proportion 

will decrease accordingly, so that specific probes will still comprise at least 
50% of the nucleic acids in the reagent composition. 

Probes can be labeled according to any means known in the art. These 
may indude radioactive labels, fluorescent labels, enzjonatic labels, and binding 
10 partner labels sudi as biotin. Means for labeling and detecting probes are well 

known in the art. Probes comprise at least 10, 1 1, 12, 15, 20, or 30 contiguous 
nucleotides of a selected gene. 

This invention provides proteins or polypeptides expressed from the 
polynucleotides of this invention, which is intended to include wild-type and 
15 recombinantly produced polj^eptides and proteins from procaryotic and 

eucaryotic host cells, as well as muteins, analogs and fragments thereof In 
some embodiments, the term also includes antibodies and anti-idiotypic 
antibodies. 

It is understood that functional equivalents or variants of the wild-type 
20 polypeptide or protein also are within the scope of this invention, for example, 

those having conservative amino acid substitutions. Other analogs include 
fusion proteins comprising a protein or polypeptide. 

The proteins and polypeptides of this invention are obtainable by a 
number of processes well known to those of skill in the art, which include 
25 purification, chemical synthesis and recombinant methods. Full length proteins 

can be purified from a colon or pancreatic cell or tissue lysate by methods such 
as immunoprecipitation with antibody, and standard techniques such as gel 
filtration, ion-exchange, reversed-phase, and affinity chromatography using a 
fusion protein as shown herein. For such methodology, see for example 
30 Deutscher et al. (1999) Guide To Protein Purification: Methods In 

Enzymology (Vol. 182, Academic Press). Accordingly, this invention also 



BNSDCr-D: <WO 98533l9A2_l_> 



85 

provides the processes for obtaining these proteins and polypeptides as well as 
the products obtainable and obtained by these processes. 

The proteins and polypeptides also can be obtained by chemical 
synthesis using a commercially available automated peptide synthesizer such 
as those manufactured by Perkin Elmer/Applied Biosystems, Inc., Model 430A 
or 431 A, Foster City. The synthesized protein or polypeptide can be 
precipitated and further purified, for example by high performance liquid 
chromatography (HPLC). Accordingly, this invention also provides a process 
for chemically synthesizing the proteins of this invention by providing the 
sequence of the protein and reagents, such as amino acids and enzymes and 
linking together the amino acids in the proper orientation and linear sequence. 

Alternatively, the proteins and polypeptides can be obtained by 
well-known recombinant methods as described, for example, in Sambrook et 
al., (1989), supra, using the host cell and vector systems described above. 

Also provided by this application are the polypeptides and proteins 
described herein conjugated to a detectable agent for use in the diagnostic 
methods. For example, detectably labeled proteins and polypeptides can be 
bound to a column and used for the detection and purification of antibodies. 
They also are useful as immunogens for the production of antibodies as 
described below. The proteins and fi-agments of this invention are useful in an 
in vitro assay system to screen for agents or drugs, which modulate cellular 
processes. 

The proteins of this invention also can be combined with various liquid 
phase carriers, such as sterile or aqueous solutions, pharmaceutically 
acceptable carriers, suspensions and emulsions. Examples of non-aqueous 
solvents include propyl ethylene glycol, polyethylene glycol and vegetable oils. 
When used to prepare antibodies, the carriers also can include an adjuvant that 
is useful to non-specifically augment a specific immune response. A skilled 
artisan can easily determine whether an adjuvant is required and select one. 
However, for the purpose of illustration only, suitable adjuvants include, but 
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are not limited to Freund's Complete and Incomplete, mineral salts and 
polynucleotides. 

This invention also provides a pharmaceutical composition comprising 
any of a protein, analog, mutein, polypeptide fragment, antibody, antibody 
5 fragment or anti-idiotipic antibody of this invention, alone or in combination 

Avith each other or other agents, and an acceptable carrier. These compositions 
are useful for various diagnostic and therapeutic methods. 

Antibodies can be generated using the proteins encoded by the 
transcripts identified by the tags disclosed herein. Use of all or portions of the 

10 protein as immunogens is routine in the art. Similarly, fusion proteins can be 

used as immunogens. Antibodies can be affinity purified using the proteins or 
portions thereof used as immunogens. Similarly, monoclonal antibodies 
specifically immunoreactive with the protein sequences of the invention can be 
generated according to techniques which are well known in the art. 

15 Antibodies can be used analytically to quantitate the expression of 

particular transcripts identified herein as upregulated or downregulated in 
cancer In addition, antibodies can be conjugated or non-covalently linked to 
cytotoxic agents, such as cytotoxins, radionuclides, chemotherapeutic drugs, 
etc. Such antibodies can be used therapeutically to specifically target cancer 

20 cells in which the protdn antigens are upregulated. These include the proteins 

encoded by the transcripts identified by the tags shown in Tables 2, 4, and 5. 
Means of making such linked cytotoxic antibodies and of administering the 
same are well known in the art. 

Also provided by this invention is an antibody capable of specifically 

25 forming a conqjlex with the proteins or polypeptides as described above. The 

term "antibody^' includes polyclonal antibodies and monoclonal antibodies. 
The antibodies include, but are not limited to mouse, rat, and rabbit or human 
antibodies. 

Laboratory methods for producing polyclonal antibodies and 
30 monoclonal antibodies, as well as deducing their corresponding nucleic acid 

sequences, are known in the art, see Harlow and Lane (1988) supra and 
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Sambrook et al. (1989) supra. The monoclonal antibodies of this invention can 
be biologically produced by introducing protein or a fragment thereof into an 
animal, e.g., a mouse or a rabbit. The antibody producing cells in the animal 
are isolated and fused with myeloma cells or heteromyeloma cells to produce 
5 hybrid cells or hybridomas. Accordingly, the hybridoma cells producing the 

monoclonal antibodies of this invention also are provided. 

Thus, using the protein or fragment thereof, and well known methods, 
one of skill in the art can produce and screen the hybridoma cells and 
antibodies of this invention for antibodies having the ability to bind the proteins 

10 or polypeptides. 

If a monoclonal antibody being tested binds with the protein or 
polypeptide, then the antibody being tested and the antibodies provided by the 
hybridomas of this invention are equivalent. It also is possible to determine 
without undue experimentation, whether an antibody has the same specificity 

15 as the monoclonal antibody of this invention by determining whether the 

antibody being tested prevents a monoclonal antibody of this invention from 
binding the protein or polypeptide with which the monoclonal antibody is 
normally reactive. If the antibody being tested competes vnth the monoclonal 
antibody of the invention as shown by a decrease in binding by the monoclonal 

20 antibody of this invention, then it is likely that the two antibodies bind to the ? 

same or a closely related epitope. Alternatively, one can pre-incubate the 
monoclonal antibody of this invention with a protein with which it is normally 
reactive, and determine if the monoclonal antibody being tested is inhibited in 
its ability to bind the antigen. If the monoclonal antibody being tested is 

25 inhibited then, in all likelihood, it has the same, or a closely related, epitopic 

specificity as the monoclonal antibody of this invention. 

The term "antibody" also is intended to include antibodies of all 
isotypes. Particular isotypes of a monoclonal antibody can be prepared either 
directly by selecting from the initial fiision, or prepared secondarily, from a 

30 parental hybridoma secreting a monoclonal antibody of different isotype by 

using the sib selection technique to isolate class switch variants using the 
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procedure desaibed in Steplewski et al. (1985) Proc. Natl. Acad. Sci. 82:8653 
or Spira et al. (1984) J. Immunol. Methods 74:307. 

This invention also provides biological active fragments of the 
polyclonal and monoclonal antibodies described above. These ''antibody 
5 fragments" retain some ability to selectively bind with its antigen or 

inununogen. Such antibody fragments can include, but are not limited to: 

(1) Fab, 

(2) Fab\ 

(3) F(ab')2, 
10 (4) Fv, and 

(5) SCA 

A specific example of "a biologically active antibody fragment" is a 
CDR region of the antibody. Methods of making these fragments are known 
in the art, see for example, Harlow and Lane, (1988) supra. 
15 The antibodies of this invention also can be modified to create chimeric 

antibodies and humanized antibodies (Oi, et al. (1986) BioTechniques 
4(3):214). Chimeric antibodies are those in which the various domains of the 
antibodies* heavy and light chains are coded for by DNA from more than one 
species. 

20 The isolation of other hybridomas secreting monoclonal antibodies with 

the specificity of the monoclonal antibodies of the invention can also be 
accomplished by one of ordinary skill in the art by producing anti-idiotypic 
antibodies (Heriyn, et al. (1986) Science 232: 100). An anti-idiotypic antibody 
is an antibody which recognizes unique determinants present on the 

25 monoclonal antibody produced by the hybridoma of interest. 

Idiotypic identity between monoclonal antibodies of two hybridomas 
demonstrates that the two monoclonal antibodies are the same with respect to 
their recognition of the same epitopic determinant. Thus, by using antibodies 
to the epitopic determinants on a monoclonal antibody it is possible to identify 

30 other hybridomas expressing monoclonal antibodies of the same epitopic 

specificity. 
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It is also possible to use the anti-idiotype technology to produce 
monoclonal antibodies which mimic an epitope. For example, an anti-idiotypic 
monoclonal antibody made to a first monoclonal antibody will have a binding 
domain in the hypervariable region which is the mirror image of the epitope 
5 bound by the first monoclonal antibody. Thus, in this instance, the 

anti-idiotypic monoclonal antibody could be used for immunization for 
production of these antibodies. 

As used in this invention, the term "epitope" is meant to include any 
determinant having specific aflBnity for the monoclonal antibodies of the 

10 invention. Epitopic determinants usually consist of chemically active surface 

groupings of molecules such as amino acids or sugar side chains and usually 
have specific three dimensional structural characteristics, as well as specific 
charge characteristics. 

The antibodies of this invention can be linked to a detectable agent or , 

15 label There are many difierent labels and methods of labeling known to those 

of ordinary skill in the art. 

The antibody-label complex is usefiil to detect the protein or fi-agments 
in a sample, using standard immunochemical techniques such as 
immunohistochemistry as described by Harlow and Lane (1988) supra. 

20 Competitive and non-competitive immunoassays in either a direct or indirect . 

format are examples of such assays, e.g., enzyme linked immunoassay (ELISA) 
radioimmunoassay (RIA) and the sandwich (immunometric) assay. Those of 
skill in the art will know, or can readily discern, other immunoassay formats 
without undue experimentation. 

25 The coupling of antibodies to low molecular weight haptens can 

increase the sensitivity of the assay. The haptens can then be specifically 
detected by means of a second reaction. For example, it is common to use 
haptens such as biotin, which reacts avidin, or dinitropherryl, pyridoxal, and 
fluorescdn, which can react with specific anti-hapten antibodies. See Harlow 

30 and Lane (1988) supra. 
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The monoclonal antibodies of the invention also can be bound to many 
diflFerent carriers. Thus, this invention also provides compositions containing 
the antibodies and another substance, active or inert. Examples of well-known 
carriers include glass, polystyrene, polypropyl^e, polyethylene, dextran, nylon, 
5 amylases, natural and modified celluloses, polyacrylamides, agaroses and 

magnetite. The nature of the carrier can be either soluble or insoluble for 
purposes of the invention. Those skilled in the art will know of other suitable 
carriers for binding monoclonal antibodies, or will be able to ascertain such, 
using routine experimentation. 

10 Compositions containing the antibodies, fi-agments thereof or cell lines 

whidi produce the antibodies, are encompassed by this invention. When these 
compositions are to be used pharmaceutically, they are combined with a 
pharmaceutically acceptable carrier. 

The present invention also provides a screen for various agents which 

15 modulate the expression of a gene in a pancreatic or colon cell. To practice the 

method in vitro, suitable cell cultures or tissue cultures are first provided. The 
cell can be a cultured cell or a genetically modified cell in which a trancript 
fi-om SEQ ID NOS: 1-732, or their complements, is expressed. Alternatively, 
the cells can be fi-om a tissue biopsy. The cells are cultured under conditions 

20 (temperature, growth or culture medium and gas (COj)) and for an appropriate 

amount of time to attain exponential proliferation without density dependent 
constraints. It also is desirable to maintain an additional separate cell culture; 
one which does not receive the agent being tested as a control. 

As is apparent to one of skill in the art, suitable cells may be cultured 

25 in microtiter plates and several agents may be assayed at the same time by 

noting genotypic changes, phenotypic changes or cell death. 

When the agent is a composition other than a DNA or RNA, the agent 
may be directly added to the cell culture or added to culture medium for 
addition. As is apparent to those skilled in the art, an "effective" amount must 

30 be added which can be empirically determined. When the agent is a 

polynucleotide, it may be directly added by use of a gene gun or 



BNSDOCID: <WO_9853319A2_L> 



eiectroporation. AJtematively, it may be inserted into the cell using a gene 
delivery vehicle or vector as described above. 

An agent is a potential therapeutic if it alters the expression of gene in 
the cell. Altered expression can be detected by assaying for altered mRNA 
expression or protein expression using the probes, primers and antibodies as 
described herein. 

For the purposes of this invention, an "agent" is intended to include, but 
not be limited to a biological or chemical compound such as a simple or 
complex organic or inorganic molecule, a peptide, a protein (e.g. antibody) or 
a polynucleotide (e.g. anti-sense). A vast array of compounds can be 
synthesized, for example polymers, such as polypeptides and polynucleotides, 
and synthetic organic compounds based on various core structures, and these 
are also included in the term "agent". In addition, various natural sources can 
provide compounds for screening, such as plant or animal extracts, and the 
like. It should be understood, although not always explicitly stated that the 
agent is used alone or in combination with another agent, having the same or 
diflFerent biological activity as the agents identified by the inventive screen. The 
agents and methods also are intended to be combined with other therapies. 

The above disclosure generally describes the present invention. A more 
complete understanding can be obtained by reference to the following specific 
examples which are provided herein for purposes of illustration only, and are 
not intended to limit the scope of the invention. 



This example demonstrates the characterization of the general 
transcription of human colorectal epithelium, colorectal cancers, and pancreatic 
cancers. 

We used the recently developed SAGE (serial analysis of gene 
expression) method to identify and quantify a total of 303,706 transcripts 



derived from human colprficlftl (CR) epithelium, CRcancers or pancreatic 



cancers (Table lA ) (3). These transcripts represented approximately 48,741 
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diflfCTent genes (4) that ranged in average expression from 1 copy per cell to as 
many as 5,300 copies per ceU (5), The number of different transcripts observed 
in each cell population varied from 14,247 to 20,471 . The bulk of the mRNA 
mass (75%) consisted of transcripts e?q3ressed at more than five copies per cell 
on average (Table IB). In contrast, the majority (86%) of transcripts were 
expressed at less than 5 copies per cell, but in aggregate this low abundance 
class represented only 25% of the mRNA mass. This distribution was 
consistently observed among the different samples analyzed and was consistent 
with previous studies of RNA abundance classes based on RNA-DNA 
reassociation kinetics (Rot curves) . Monte Carlo simulations revealed that our 
analyses had a 92% probability of detecting a transcript expressed at an 
average of three copies per cell (7). 
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Many of the SAGE tags appeared to represent previously undescribed 
transcripts, as only 54% of the tags matched entries in GenBank (Table 1). 
Twenty percent of these matching transcripts corresponded to characterized 
mRNA sequence entries in GenBank, whereas 80% matched uncharacterized 
5 EST entries. As expected, the likelihood of a tag being present in the 

databases was related to abimdance; GaiBank matches were identified for 98% 
of the transcripts expressed at more than 500 copies per cell but for only 5 1% 
of the transcripts expressed at < 5 copies per cell. Because the SAGE data 
provide a quantitative assay of transcript abundance, unaffected by diflFerences 
10 in cloning or PGR eflficiency, these data provide an independent and relatively 

unbiased estimate of the current completeness of publicly available EST 
databases. i 

EXAMPLE 2 

This example demonstrates a comparison of the expression pattern of 

15 normal colon epithelium and primary colon cancers. 

Comparison of expression patterns between normal colon epithelium 
and primary colon cancers revealed that the majority of transcripts were 
expressed at similar levels (Fig. lA). However, the expression profiles also 
revealed 289 transcripts that were expressed at significantly different levels [P 

20 < 0.01, (8)]. Of these 289, 181 were decreased in colon tumors compared to 

normal colon (average decrease 10-fold; Fig, IB; examples in Fig. 2 A). 
Conversely, 108 transcripts were expressed at higher levels in the colon 
cancers than in nomial colon (average increase 13 -fold; Fig. IC; examples in 
Fig. 2A). Monte Carlo simulations indicated that the analysis would have 

25 detected over 95% of those transcripts expressed at a 6-fold or greater level 

in normal vs. tumor cells or vice versa (9), Because relatively stringent criteria 
were used for defining differences [P < 0.01, (5)], the number of differences 
reported above is likely to be an underestimate. 
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EXAMPLE 3 

This example demonstrates the similarities and differences between 
cancer cell line transcription and transcription of primary cancer tissues. 
To determine how many of the 289 differences were independent of the cellular 
5 micrpenvironment of cancers in vivo, SAGE data from CR cancer cell lines was 

compared to that from primary CR cancer tissues (Fig. IB, IC). Perhaps 
surprisingly, the majority of transcripts (130 of 181) that were expressed at 
reduced levels in cancer cells in vivo were also expressed at significantly lower 
levels in the cell lines (Fig. IB). Likewise, a significant fraction of the 

10 transcripts expressed at increased levels in primary cancers were also expressed 

at higher levels in the CR cancer cell lines (Fig. IC). Thus, many of the gene 
expression differences that distinguish normal from tumor cells in vivo persist 
during in vitro growth. However, despite these similarities there were also 
many differences. For example, only 47 of 228 genes expressed at higher 

15 levels in CR cancer cell lines were also expressed at high levels in the primary 

CR cancers. 

In combination, comparing the expression pattern of CR cancer cells 
(in vivo or in vitro) to normal colon revealed 548 differentially expressed 
transcripts (Fig. 1B,C, Tables 2 and 3). The average difference in expression 

20 for these transcripts was 15 fold. Although the ability to detect differences is 

influenced by the magnitude of the variance with the power to detect smaller 
differences being less, 92 transcripts that were less than three fold different 
were identified among the 548 transcripts. However, those genes exhibiting 
the greatest differences in expression are likely to be the most biologically 

25 important. 



BNSDOCID: <WO 98S3319A2_I_> 



98 

EXAMPLE 4 

This example demonstrates the similarities and differences between 
colorectal cancer transcription and pancreatic cancer transcription. 
To detennune \^ether the changes noted in CR cancers were neoplasia or cell 
type spedfic, we paformed SAGE on mRNA derived from pancreatic cancers. 
A total of 404 transaipts were e?q7ressed at higher levels in pancreatic cancers 
compared to normal colon epithelium (examples in Fig. 2B), The majority 
(268) of these transcripts were pancreas-specific (70) (Example in Fig. 2C) 
although 136 were also expressed at high levels in CR cancers. These 136 
transcripts constituted 47% of the 289 transcripts increased in CR cancers 
relative to normal colon and are likely to be related to the neoplastic process 
rather than to the specific cell type of origin. 

EXA MPLES 

This example demonstrates the reproducibility of the transcription 
patterns observed among a larger number of cancer samples. 

One question that arose from these data is the potential heterogeneity 
of e5q)ression between individual tumors. The SAGE data were acquired from 
two ^camples of each tissue type (normal colon, primary CR cancer, CR cancer 
cell line, etc.). To examine the generality of these expression profiles, we 
arbitrarily selected 27 differentially expressed transcripts and evaluated them 
in six to twelve samples of nomial colon and primary cancers by Northern blot 
analysis (77). In general, expression patterns were very reproducible among 
different samples. Of 10 genes with elevated expression in normal colon 
relative to CR cancers as determined by SAGE, each was detected in the 
normal colon samples and was expressed at considerably lower levels in tumors 
(examples in Fig. 2A). Similarly, most of the genes identified by SAGE as 
increased in CR or pancreatic cancers were confirmed to be reproducibly 
expressed in the majority of primary cancers examined by Northern blot 
(examples in Fig. 2A). It is important to note, however, that there were 
dififerences among the cancers, with a few cancers exhibiting particularly high 
or low levels of individual transcripts. Such differences in gene expression 
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undoubtedly contribute to the observed heterogeneity in biological properties 
of cancers derived from the same organ . 

EXAMPLE 6 

This example demonstrates the identities of some of the transcripts 
5 which were found to be diflferentially expressed in tumor and normal tissues. 

What are the identities of the diflferentially expressed genes? Of the 548 
differentially expressed transcripts, 337 were tentatively identified through 
database comparisons. When tested, the great majority (93%) of these 
identifications proved to be legitimate (13), as expected from previous SAGE 

10 analyses . Although a large number of diflferentially expressed genes were 

identified, some simple patterns did emerge. For example, genes that were 
expressed at higher levels in normal colon epithelium than in CR tumors were 
often diflferentiation-related. These genes included liver fatty acid binding 
protein , cytokeratin 20 , carbonic anhydrase , guanylin and uroguanylin , 

15 which are known to be important for the normal physiology or architecture of 

the colon epithelium (Table 2). On the other hand, genes that were increased 
in CR cancers were often related to the robust growth characteristics that these 
cells exhibit. For example, gene products associated with protein synthesis, 
including 48 ribosomal proteins, five elongation factors, and five genes 

20 involved in glycolysis were observed to be elevated in both CR and pancreatic 

cancers compared to normal colon cells. Although the majority of the 
transcripts couH not have been prediaed to be diflferentially expressed in 
cancers, several have previously been shown to be dysregulated in neoplastic 
cells. The latter included IGFII , B23 nucleophosmin, the Pi form of 

25 glutathione S-transferase, and several ribosomal proteins which were all 

increased in cancer cells as previously reported. Likewise, Dra and gelsolin 
were both decreased in cancer as previously reported. Surprisingly, two widely 
studied oncogenes, c-fos and c-erbbS, were expressed at much higher levels in 
normal colon epithelium than CR cancers, in contrast to their up-regulation in 

30 transformed cells . 
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In summary, these data provide basic information necessary for 
understanding the gene expression diflferences that underlie cancer phenotypes. 
They additionally provide a necessary framework for interpreting the 
significance of individual di£ferentiaUy expressed genes. Although this study 
5 demonstrated that a large number of such differences exist (approximately 500 

at the depth of analysis employed), it was equally remarkable that the fraction 
of transcripts exhibiting significant differences was relatively small, 
representing 1 .5 % of the transcripts detected in any given cell type (26). The 
fact that many, but not all, of the differences were preserved during in vitro 

10 culture demonstrates the utility of cultured lines for examination of some 

aspects of gene expression, but also provides a note of caution in relying on 
such lines to perfectly mimic tumors in their natural environment. Finally, the 
finding that hundreds of specific genes are expressed at different levels in CR 
cancers, and that some of these are also expressed differentially in pancreatic 

15 cancers, provides a wealth of new reagents for fiiture biologic and diagnostic 

experimentation. 
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5 6. J. O. Bishop, X G. Morton, M. Rosbash, M. Richardson, Nature 

250, 199 (1974); B. Lewin, Gene Expression Vol 2 (John Wiley and sons. 
New York 1980). 

7. Computer simulations indicated that smalysis of 300,000 tags 
would yield a 92 % chance of detecting a tag for a transcript whose expression 
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large number of comparisons being made, Monte Carlo analysis was used for 
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significant p-chance value below the cutoflf. 
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30 one sample and 0.0006 in a second sample revealed a significant difference (P 

< 0.01, [8]) 95% of the time. 
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CLAIM S 

1. A method of diagnosing colon cancer in a sample suspected of being 
neoplastic, comprising the steps of: 

comparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a colonic tissue 
suspected of being neoplastic and the second sample is of a normal human 
colonic tissue, and wherein the transcript is identified by a tag selected fi-om the 
group consisting of those shown in Table 3; 

identifying the first sample as neoplastic when the level of the 
at least one transcript is found to belower in the first sample than in the second 
sample. 

2. A method of diagnosing colon cancer in a sample suspected of being 
neoplastic, comprising the steps of: 

comparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a colonic tissue 
suspected of being neoplastic and the second sample is of a normal human 
colonic tissue, and wherein the transcript is identified by a tag selected fi-om the 
group consisting of those shown in Table 2; 

identifying the first sample as neoplastic when the level of the 
at least one transcript is found to be higher in the first sample than in the 
second sample. 

3. The method of claim 1 wherein a comparison of at least two of said 
transcripts is performed. 

4. The method of claim 2 wherein a comparison of at least two of said 
transcripts is performed. 
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5. The method of claim 1 wherein a comparison of at least five of said 
transcripts is performed. 

6. The method of claim 2 wherein a comparison of at least five of said 
transcripts is performed. 

7. The method of claim 1 wherein a comparison of at least ten of said 
transcripts is performed. 

8. The method of claim 2 wherein a comparison of at least ten of said 
transcripts is performed. 

9. The method of claim 1 wherein a comparison of at least twenty of said 
transcripts is performed. 

10. The method of claim 2 wherein a comparison of at least twenty of said 
transcripts is performed. 

11. The method of claim 1 wherein a comparison of at least thirty of said 
transcripts is performed. 

12. The method of claim 2 wherein a comparison of at least thirty of said 
transcripts is performed. 

13. An isolated and purified human nucleic acid molecule which comprises 
a SAGE tag selected fi-om SEQ ID NO: 1-732. 

14. The nucleic acid molecule of claim 13 which is a cDNA molecule. 
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15. The nucleic add molecule of claim 13 wherein the SAGE tag is located 
at the 3* end of the molecule, adjacent to the 3 -most NlalU restriction enzyme 
site. 

16. An isolated nucleotide probe comprising at least 10 nucleotides of a 
5 human nucleic acid molecule, wherein the human nucleic acid molecule 

comprises a SAGE tag selected from SEQ ID NO; 1-732. 

17. The probe of claim 16 which comprises the selected SAGE tag. 

18. A diagnostic reagent for evaluating neoplasia of a colorectal tissue, 
comprising at least 2 probes according to claim 16. 

10 19. The diagnostic reagent of claim 18 which comprises at least 5 probes 

according to claim 16. 

20. The diagnostic reagent of claim 18 which comprises at least 10 probes 
according to claim 16. 

21. The diagnostic reagent of claim 18 which comprises at least 20 probes 
15 according to claim 16. 

22. The diagnostic reagent of claim 18 which comprises at least 30 probes 
according to claim 16. 

23. A diagnostic reagent for evaluating neoplasia of a colorectal tissue, 
comprising at least 2 probes according to claim 17. 



20 



24. A method of diagnosing pancreatic cancer in a sample suspected of 
being neoplastic, comprising the steps of: 



<WO 96533 19A2 I > 



wo 98/53319 PCT/US98/10277 

108 

cx>mparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a pancreatic tissue 
suspected of being neoplastic and the second sample is of a normal human 
colon tissue, wherein said transcript is identified by a tag selected fi*om the 
5 group consisting of those shown Table 4; 

identifying the first sample as neoplastic when the level of the 
at least one transcript is found to be higher in the first sample than in the 
second sample. 

25. A method of diagnosing cancer in a sample suspected of being 
10 neoplastic, comprising the steps of: 

con^>aring the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a tissue suspected of 
being neoplastic and the second sample is of a normal human tissue of the 
same tissue type, \^iierein said transcript is identified by a tag selected firom the 
15 group consisting of those shown Table 5; 

identifying the first sample as neoplastic when the level of the 
at least one transcript is found to be higher in the first sample than in the 
second sample. 

26. A method to aid in the determination of a prognosis for a colon cancer 
20 patient, comprising the steps of: 

comparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a neoplastic colonic 
tissue and the second sample is of a normal human colonic tissue, and wherein 
the transcript is identified by a tag selected firom the group consisting of those 
25 shown in Table 3 ; 

determining a poorer prognosis if the level of the at least one 
transcript is found to be lower in the first sample than in the second sample. 

27. A method to aid in determining a prognosis for a patient with colon 
cancer, comprising the steps of: 
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comparing the level of at least one transcript in a first tissue 
sample to a second sample, wherein the first sample is of a colonic cancer 
tissue and the second sample is of a normal human colonic tissue, and wherein 
the transcript is identified by a tag selected fi-om the group consisting of those 
shown in Table 2; 

determining a poorer prognosis if the level of the at least one 
transcript is found to be higher in the first sample than in the second sample. 

28. A method of diagnosing colon cancer in a sample suspected of being 
neoplastic, comprising the steps of: 

comparing the level of expression of at least one protein in a 
first sample of a tissue to a second sample, wherein the first sample is of a 
colonic tissue suspected of being neoplastic and the second sample is of a 
normal himian colonic tissue, and wherein the protein is encoded by a transcript 
identified by a tag selected from the group consisting of those shown in Table 
3; 

identifying the first sample as neoplastic when the level of 
expression of the protein is found to be lower in the first sample than in the 
second sample. 

29. A method of diagnosing colon cancer in a sample suspected of being 
neoplastic, comprising the steps of: 

comparing the level of expression of at least one protein in a 
first sample of a tissue to a second sample, wherein the first sample is of a 
colonic tissue suspected of being neoplastic and the second sample is of a 
normal human colonic tissue, and wherein the protein is encoded by a transcript 
identified by a tag selected from the group consisting of those shown in Table 
2; 

identifying the first sample as neoplastic when expression of the 
protein is found to be higher in the first sample than in the second sample. 
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30. A method to aid in determining a prognosis of a patient having 
pancreatic cancer, comprising the steps of: 

comparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a neoplastic 
S pancreatic tissue and the second sample is of a normal human colon tissue, 

wherdn said transcript is identified by a tag selected firom the group consisting 
of those shown Table 4; 

determining a poorer prognosis if the level of the at least one 
transcript is found to be higher in the first sample than in the second sample. 

10 31. A method to aid in providing a prognosis for a cancer patient, 

comprising the steps of: 

comparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of a neoplastic tissue 
and the second sample is of a normal human tissue of the same tissue type, 
15 wherein said transcript is identified by a tag selected fi-om the group consisting 

of those shown Table 5; 

determining a poorer prognosis if the level of the at least one 
transcript is found to be higher in the first sample than in the second sample. 

32. A method of diagnosing pancreatic cancer in a sample suspected of 

20 being neoplastic, comprising the steps of: 

comparing the level of expression of at least one protein 
mcoded by a transcript in a first sample of a tissue to a second sample, wherein 
the first sample is of a pancreatic tissue suspected of being neoplastic and the 
second sample is of a normal human colon tissue, wherein said protein is 

25 encoded by a transcript identified by a tag selected fi-om the group consisting 

of those shown Table 4; 

identifying the first sample as neoplastic when expression of the 
protein is found to be higher in the first sample than in the second sample. 
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33. A method of diagnosing cancer in a sample suspected of being 
neoplastic, comprising the steps of: 

comparing the level of expression of at least one protein in a 
first sample of a tissue to a second sample, wherein the first sample is of a 
5 tissue suspected of being neoplastic and the second sample is of a normal 

human tissue, wherein said protein is encoded by a transcript identified by a tag 
selected fi'om the group consisting of those shown Table 5; 

identifying the first sample as neoplastic when expression of the 
protein is found to be higher in the first sample than in the second sample. 



10 34. A method to aid in the determination of a prognosis for a colon cancer 

patient, comprising the steps of: 

comparing the level of expression of at least one protein in a 
first sample of a tissue to a second sample, wherein the first sample is of a 
neoplastic colonic tissue and the second sample is of a normal human colonic 
15 tissue, and wherein the protein is encoded by a transcript identified by a tag 

selected fi-om the group consisting of those shown in Table 3; 

determining a poorer prognosis if the level of expression is 
found to be lower in the first sample than in the second sample. 

35. A method to aid in determining a prognosis for a patient with colon 
20 cancer, comprising the steps of: 

comparing the level of expression of at least one protein in a 
first tissue sample to a second sample, wherein the first sample is of a colonic 
cancer tissue and the second sample is of a normal human colonic tissue, and 
wherein the protdn is encoded by a transcript identified by a tag selected firom 
25 the group consisting of those shown in Table 2; 

determining a poorer prognosis if the level of expression is 
found to be higher in the first sample than in the second sample. 
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36. A method to aid in determining a prognosis of a patient having 
pancreatic cancer, comprising the steps of: 

comparing the level of expression of at least one protein in a 
first sample of a tissue to a second sample, wherein the first sample is of a 
neoplastic pancreatic tissue and the second sample is of a normal human colon 
tissue, wherein said protein is encoded by a transcript identified by a tag 
selected from the group consisting of those shown Table 4; 

determining a poorer prognosis if the level of expression is 
found to be higher in the first sample than in the second sample. 

37. A method to aid in providing a prognosis for a cancer patient, 
comprising the steps of: 

comparing the level of expression of at least one protein in a 
first sample of a tissue to a second sample, wherein the first sample is of a 
neoplastic tissue and the second sample is of a nonnal human tissue of the same 
tissue type, wherein said protein is encoded by a transcript identified by a tag 
selected from the group consisting of those shown Table 5; 

determining a poorer prognosis if the level of expression is 
found to be higher in the first sample than in the second sample. 

38. A method of treating a cancer cell, comprising the step of: 
administering to a cancer cell an antibody which specifically 

binds to a protein encoded by a transcript identified by a tag selected from the 
group consisting of those shown in Tables 2, 4, and 5, wherein the antibody is 
linked to a cytotoxic agent. 

39. An antibody linked to a cytotoxic agent, wherein the antibody 
specifically binds to a protein encoded by a transcript identified by a tag 
selected from the group consisting of those shown in Tables 2, 4, and 5. 
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40. A method of detecting colon cancer in a patient, comprising the steps 
of: 

comparing the level of at least one protein in a first body sample 
to a second body sample, wherein the first sample is a body sample of the 
5 patient and the second sample is of a normal human, wherein the protein is 

encoded by a transcript identified by a tag seleaed firom the group consisting 
of those shown in Table 2, wherein the first and second body sample is a 
sample selected fi-om the group consisting of blood, urine, feces, sputum, and 
serum; 

10 identifying neoplasia when the level of the at least one protein 

is found to be higher in the first sample than in the second sample. 

41. A method of detecting pancreatic cancer in a patient, comprising the 
steps of: 

comparing the level of at least one protein encoded by a 
15 transcript in a first sample of a tissue to a second sample, wherein the first 

sample is of the patient and the second sample is of a normal human, wherein ^ 
said protein is encoded by a transcript identified by a tag selected firom the 
group consisting of those shown Table 4, wherein the first and second sample 
is a sample selected fi^om the group consisting of blood, urine, feces, sputum, 
20 and semm; 

identifying neoplasia when the level of the at least one protein 
is found to be higher in the first sample than in the second sample. 

42. A method of detecting cancer in a patient, comprising the steps of 

comparing the level of at least one protein in a first sample to 
25 a second sample, wherein the first sample is of patient and the second sample 

is of a normal human, wherein said protein is encoded by a transcript identified 
by a tag selected fi-om the group consisting of those shown Table 5, wherein 
the first and second body sample is a sample selected firom the group consisting 
of blood, urine, feces, sputum, and serum; 
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identifying neoplasia when the level of the at least one protein 
is found to be higher in the first sample than in the second sample. 

43. A method to aid in determining a prognosis for a patient with colon 
cancer, comprising the steps of: 

comparing the level of at least one protein in a first sample to 
a second sample, wherdn the first sample is of a colonic cancer patient and the 
second sample is of a normal human, wherein the protein is encoded by a 
transcript identified by a tag selected fi-om the group consisting of those shown 
in Table 2, wherein the first and second sample is a sample selected fi^om the 
group consisting of blood, urine, feces, sputum, and serum; 

determining a poorer prognosis if the level of the at least one 
protein is found to be higher in the first sample than in the second sample. 

44. A method to aid in determining a prognosis of a patient having 
pancreatic cancer, comprising the steps of: 

comparing the level of at least one protein in a first sample to 
a second sample, wherein the first sample is of a pancreatic cancer patient and 
the second sample is of a normal human, wherein said protein is encoded by a 
transcript identified by a tag selected from the group consisting of those shown 
Table 4, wherein said first and second sample is a sample selected firom the 
group consisting of blood, urine, feces, sputum, and serum; 

determining a poorer prognosis if the level of the at least one 
protein is found to be higher in the first sample than in the second sample. 

45. A method to aid in providing a prognosis for a cancer patient, 
comprising the steps of: 

comparing the level of expression of at least one protein in a 
first san^)le to a second sample, wherein the first sample is of a cancer patient 
and the second sample is of a normal human, wherein said protein is encoded 
by a transcript identified by a tag selected fi'om the group consisting of those 
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shown Table 5, wherein the first and second sample is a sample selected from 
the group consisting of blood, urine, feces, sputum, and serum; 

determining a poorer prognosis if the level of the at least one 
protein is found to be higher in the first sample than in the second sample. 

5 46. A method of detecting colon cancer in a patient, comprising the steps 

of 

comparing the level of at least one transcript in a first body 
sample to a second body sample, wherein the first sample is a body sample of 
the patient and the second sample is of a normal human, wherein the transcript 
10 is identified by a tag selected fi"om the group consisting of those shown in 

Table 2, wherein the first and second body sample is a sample selected fi-om the 
group consisting of blood, urine, feces, sputum, and serum; 

identifying neoplasia when the level of the at least one transcript 
is found to be higher in the first sample than in the second sample. 

15 47. A method of detecting pancreatic cancer in a patient, comprising the 

steps of: 

comparing the level of at least one transcript in a first sample of 
a tissue to a second sample, wherein the first sample is of the patient and the 
second sample is of a normal human, wherein said transcript is identified by a 
20 tag selected fi-om the group consisting of those shown Table 4, wherein the 

first and second sample is a sample selected firom the group consisting of 
blood, urine, feces, sputum, and serum; 

identifying neoplasia when the level of the at least one transcript 
is found to be higher in the first sample than in the second sample. 

25 48 . A method of detecting cancer in a patient, comprising the steps of 

comparing the level of at least one transcript in a first sample to 
a second sample, wherein the first sample is of patient and the second sample 
is of a normal human, wherein said transcript is identified by a tag selected 
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from the group consisting of those shown Table 5, wherein the first and second 
body sample is a sample selected from the group consisting of blood, urine, 
feces, sputum, and serum; 

identifying neoplasia when the level of the at least one transcript 
is found to be higher in the first sample than in the second sample. 

49. A method to aid in determining a prognosis for a patient with colon 
cancer, comprising the steps of: 

comparing the level of at least one transcript in a first sample to 
a second sample, \^erein the first sample is of a colonic cancer patient and the 
second sample is of a normal human, wherein the transcript is identified by a 
tag selected from the group consisting of those shown in Table 2, wherein the 
first and second sample is a sample selected from the group consisting of 
blood, urine, feces, sputum, and serum; 

determining a poorer prognosis if the level of the at least one 
transcript is found to be higher in the first sample than in the second sample. 

50. A method to aid in determining a prognosis of a patient having 
pancreatic cancer, comprising the steps of: 

comparing the level of at least one transcript in a first sample to 
a second sample, wherein the first sample is of a pancreatic cancer patient and 
the second sample is of a normal human, wherein said transcript is identified by 
a tag selected from the group consisting of those shown Table 4, wherein said 
first and second sample is a sample selected from the group consisting of 
blood, urine, feces, sputum, and serum; 

determining a poorer prognosis if the level of the at least one 
transcript is found to be higher in the first sample than in the second sample. 

51. A method to aid in providing a prognosis for a cancer patient, 
comprising the steps of: 
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comparing the level of expression of at least one transcript in 
a first sample to a second sample, wherein the first sample is of a cancer patient 
and the second sample is of a normal human, wherein said transcript is 
identified by a tag selected fi-om the group consisting of those shown Table 5, 
5 wherein the first and second sample is a sample selected fi-om the group 

consisting of blood, urine, feces, sputum, and serum; 

determining a poorer prognosis if the level of the at least one 
transcript is found to be higher in the first sample than in the second sample. 

52. A method for screening for candidate agents that modulate the 
10 expression of a polynuleotide selected from the group consisting of the 

polynucleotides in SEQ ID NOS: 1-732 or their respective complements, 
comprising contacting a test agent with a colon or pancreatic cell and 
monitoring expression of the polynucleotide, wherein the test agent which 
modifies the expression of the polynucleotide is a candidate agent. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1,3,5.7.9,11.13-23,26,28,34,52 (partial) 

INVENTION 1: 

An isolated and purified human nucleic acid molecule 
comprising SEQ ID N0:291 of table 3 (INVENTION 1). an 
isolated nucleotide probe hybridizing to it, a diagnostic 
reagent comprising at least two of such probes, methods of 
diagnosing or prognosing colon cancer using them, and a 
method for^sc-reening for agents modulating the expression of 
such a sequence using them;- - , ^ . 



2. Claims: 1,3,5,7,9,11,13-23,26,28,34,52 (partial) 
INVENTION 2 to INVENTION 259: 

An isolated and purified human nucleic acid molecule 
comprising SEQ ID N0:292 of table 3 (INVENTION 2). an 
isolated nucleotide probe hybridizing to it, a diagnostic 
reagent comprising at least two of such probes, methods of 
diagnosing or prognosing colon cancer using them, and a 
method for screening for agents modulating the expression of 
such a sequence using them. ... 



...ibidem for each of the SEQ ID Nos:293 to 549 (INVENTION 3 
to INVENTION 259) as specified in table 3, separately. 



3. Claims: 2,4,6,8,10,12-23,27,29,35,38-40,43,46,49, 
52 (partial) 



INVENTION 260 to INVENTION 549: 

An isolated. and purified human nucleic acid molecule, 
comprising SEQ ID N0:1 of table 2 (INVENTION 250)7 an 
isolated nucleotide probe hybridizing to it, a diagnostic 
reagent comprising at least two of such probes, methods of . 
diagnosing or prognosing colon cancer using them, a method 
of treating a cancercell using them, an antibody linked to 
a cytotoxic agent used in such a method, and a method for 
screening for agents modulating the expression of such a 
sequence using them. 

...ibidem for each of the SEQ ID Nos:2 to 299 (INVENTION 261 
to INVENTION 549) as specified in table 2, separately. 
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INVENTION 550 to INVENTION 732: 

An isolated and purified human nucleic acid molecule 
comprising SEQ ID NO:550 of table 4 (INVENTION 550), an 
isolated nucleotide probe hybridizing to it, a diagnostic 
reagent comprising at least two of such probes, methods of 
diagnosing or prognosing pancreatic cancer using them, a 
method of treating a cancer cell using them, an antibody 
linked to a cytotoxic agent used in such a method, and a 
method for screening for agents modulating the expression of 
such a sequence using them. 

...ib-idem for each of the SEQ ID 'Nos:551 to 732 (INVENTION 
551 to INVENTION 732) as specified in table 4, separately. 



.5.- Claims: -24, 30,32,36-,38, 39, 41,44,47 ,50- (partial) 
INVENTION 733 to INVENTION 734: 

Methods of diagnosing or prognosing pancreatic cancer 
relying on a human nucleic acid molecule comprising SEQ ID 
NO:73.1 of table 4 (INVENTION 733). a method of treating a 
cancer cell using it, and an antibody linked to a cytotoxic 
agent used in such a method. 

...ibidem for SEQ ID Nos:734 (INVENTION 734) as specified in 
table . 4. . . , . . . , 



6. Claims: 25,31,33,37-39,42,45,48,51 (partial) 
INVENTION 735 to INVENTION 870: 

Methods of diagnosing or prognosing cancer relying on a 
human nucleic acid molecule comprising SEQ ID N0:735 of 
table 5 (INVENTION 735), a method of treating a cancer cell 
using it, and an antibody linked to a cytotoxic agent used 
in such a method. 



...ibidem for each of the SEQ ID Nos:736 to 870 (INVENTION 
736 to INVENTION 870) as specified in table 5, separately. 
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